|
|
Accession Number |
TCMCG075C26482 |
gbkey |
CDS |
Protein Id |
XP_017983191.1 |
Location |
join(7976657..7976779,7977235..7977352,7977549..7977626,7980387..7980505,7980610..7980681,7981814..7981975,7982127..7982196,7982296..7982474,7982736..7982900,7982989..7983060,7983144..7983273,7983354..7983776,7984310..7984686,7984992..7985069,7985992..7986000) |
Gene |
LOC18588893 |
GeneID |
18588893 |
Organism |
Theobroma cacao |
|
|
Length |
724aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018127702.1
|
Definition |
PREDICTED: large proline-rich protein BAG6 isoform X1 [Theobroma cacao] |
CDS: ATGGGAAGCACTGGTGCTGATAAAGTTCCAAGAGATAGTGAAACTGAAGGTTCTGAGACCACAATAGAAATAAAAATAAAAACACTGGACTCTCAGACTTATACTTTGAGAGTAGATAAACAGATGCCAGTGCCTGCACTAAAAGAACAGATTGCTTCTGTAACTGGTGTGTTATCAGAGCAACAACGACTAATATGTCGTGGGAAAGTTCTAAAGGATGACCAACTACTTTCTGCTTACCATGTCGAGGATGGTCACACATTGCACATGGTGGTCAGGCAGCCGGTTCCACCATCATCTGATGGCTCACCTCATTCAGCAAATGATTCTGCATCAGGTACAAGCCGTGGTCACAGTAATCATGTAGCCCCTAGTGTTGTGATAGAAACTTTTAATGTGCCTGATCAAGGGGATGGAGTTCCTCCTGAGATCAGTCGGATTGTCTCTGCCGTTCTTGGCTCTTTCGGATTTGCAAATATAGGAAGTGGAAATATTGGGGGTGATGTCAGGGAACATGGTTCACAAAGACTTGAAAGAACATCTGGTGCTAGTGGCATGCCAGATTCATCCCAAGCTCAAACGGAACAAGCTAGCATGAGGGGTCAATCTGATAGAGTACATAGTGCTTTTGGACTTCCAGCAGCAGTTTCCTTGGGGCCTCTGCAACCTCCTGTTATTCCTGATTCTTTGGCAACTTTGTCCCAATATCTGAGTCATTTACGACGTGAATTTGATGGCATTGGTAGAGCTGGGGGAGAGGATCCTCAGGCAGCATCCTTGAGTAGGACTGGAGATAGAGATTCTAACCCTGCATCAAATTCAGGGACTGTACAGGAAGGTCTTCCAACACCTGCATCTTTGGCAGAAGTGTTACTCGCTACCAGACAACTGCTCATTGAACAAGCTGGAGAATGTCTACAACAACTTGCAAGGCAACTGGAGGATCAAGGAAATGTGACGGACTCCTCAGCACGGCTGAGTGCACAGTCCATTGCTTGGAGAACTGGAGTTCTATTACAGAACCTAGGCTCACTTTTCCTTGAGCTTGGTCGTACAACCATGACAATTCGCTTAGGTCAAACACCGTCTGAAGCTGTTGTTAATGCTGGACCTGCAGTTTTCATATCCCCTTCTGGTCCGAATCCTCTCATGGTTCAGGCTCTTCCTTTTCAACCAGGAACTAGCTTTGGTGCCTTTCCCATGGGAACTGTACAGCCTGGATCTGGTTTGGTTAATGGACTTGGGACAGGGCTTCTTCCTAGGCGTATTGATATACAAATAAGAAGAGGTTCATCGGTGGCAACACCCAATGTTAATCGAGAGGAACGTGGTGATACTGCACAACAATCGGGCCAAAGGAACCCATCAATGGGTTCTGGCAGTGAGAATCGTAGCACTCAAACAAGTTCAAGGGTCTCAGATACTCCATCTTTTGCTGGGGAATCAGGAGTGCGGGTAGTGCCAATTAGGACCATGGTTGCGGCTGTACCCACTCCCTTTGGTCGCTTACCGTCAGATTCTTCTGGTAATTCTGTGGGATTATACTACCCATTCCTTGGAAGATTCCAGCACATTGCTTCCGGACATGTTAGTGGGGAACGGGGATCTCAGGGATCTGGTGAGAATCTCTCCCATGGTGTTCAATCTGAGCAGCACCTTATTCCTGAATCTACAGCGCAACAACAAAGTTTCGAAGAGTCAACTAGAGATGGTTCATTGCCAAATCCTAATTCAAGACAACAGGAGCGATCCAATACTCGCAGTGTCAGTATAAACATTCTAGCAGCTGGCCGGACTCAAAACAACCAAGACTCAGAGAGACAAATTCCTAGTAGTGTTCAGTTTCTGAGGGCAATTTTCCCTGGTGGTGAAATCAATGTAGAGGAAGCAAGTGTACAAGGAGCAGCTACAGGTTCTGTCCAAGAGCAAGCAGGGACTTCCAGTGGTGCTCCAGCGGCTGAGCCAAGTATTACTGATCAAGGGGTGTTTTTATCTAATTTGCTTCATCAGATCATGCCATACGTATCTCAGCAAGCAAGTTCACAACAAAGTACTGTGCCTACAGAGGAAGCAAATACTTCCACCCAGGCTGAGCACACTAGTCCTGGGAGTTCACGTAGACCAAGTGACTCTGAACCAAATTCACCAAACTCAAAACGTCAGAAGACAGAGTAG |
Protein: MGSTGADKVPRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSASGTSRGHSNHVAPSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANIGSGNIGGDVREHGSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLATLSQYLSHLRREFDGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQEGLPTPASLAEVLLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLELGRTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSGLVNGLGTGLLPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRSTQTSSRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIASGHVSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRDGSLPNPNSRQQERSNTRSVSINILAAGRTQNNQDSERQIPSSVQFLRAIFPGGEINVEEASVQGAATGSVQEQAGTSSGAPAAEPSITDQGVFLSNLLHQIMPYVSQQASSQQSTVPTEEANTSTQAEHTSPGSSRRPSDSEPNSPNSKRQKTE |